Using Lexical Semantic Knowledge from Machine Readable Dictionaries for Domain Independent Language Modelling
نویسندگان
چکیده
Machine Readable Dictionaries (MRDs) have been used in a variety of language processing tasks including word sense disambiguation, text segmentation, information retrieval and information extraction. In this paper we describe the utilization of semantic knowledge acquired from an MRD for language modelling tasks in relation to speech recognition applications. A semantic model of language has been derived using the dictionary definitions in order to compute the semantic association between the words. The model is capable of capturing phenomena of latent semantic dependencies between the words in texts and reducing the language ambiguity by a considerable factor. The results of experiments suggest that the semantic model can improve the word recognition rates in “noisy-channel” applications. This research provides evidence that limited or incomplete knowledge from lexical resources such as MRDs can be useful for domain independent language modelling.
منابع مشابه
Metaphor as an Emergent Property of Machine-Readable Dictionaries
Previous computational attempts to handle nonliteral word usage have been restricted to "toy" systems that combine hand-coded lexicons with restricted sets of metaphor types that can be used to sanction specific classes of semantic subcategorization violations. These hand-coded efforts are unlikely to ever scale up to the rigors of real, free text. We describe an example-based approach to metap...
متن کاملAn Approach to Building the Hierarchical Element of a Lexical Knowledge Base from a Machine Readable Dictionary. an Approach to Building the Hierarchical Element of a Lexical Knowledge Base from a Machine Readable Dictionary 1
This abstract describes an approach to extracting taxonomies from machine readable dictionaries and using them to structure a lexical knowledge base which incorporates default inheritance. Taxonomy construction is based on an intuitive notion of the organisation of the substantial quantities of data in machine readable dictionaries which were developed for quite independent purposes. Our intent...
متن کاملExtracting Knowledge Bases from Machine- Readable Dictionaries : Have We Wasted Our Time?
Machine-readable versions of everyday dictionaries have been seen as a likely source of information for use in natural language processing because they contain an enormous amount of lexical and semantic knowledge. However, after 15 years of research, the results appear to be disappointing. No comprehensive evaluation of machine-readable dictionaries (MRDs) as a knowledge source has been made to...
متن کاملDRAFT Automatic Creation of Lexical Knowledge Bases : New Developments in Computational
Text processing technologies require increasing amounts of information about words and phrases to cope with the massive amounts of textual material available today. Information retrieval search engines provide greater and greater coverage, but do not provide a capability for identifying the specific content that is sought. Greater reliance is placed on natural language processing (NLP) technolo...
متن کاملMachine Readable Dictionaries: What Have We Learned, Where Do We Go?
Machine-readable versions of everyday dictionaries have been seen as a likely source of information for use in natural language processing because they contain an enormous amount of lexical and semantic knowledge. However, after fifteen years of research, the results appear to be disappointing. No comprehensive evaluation of machine-readable dictionaries (MRDs) as a knowledge source has been ma...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000